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1. Introduction 



A venerable principle holds that the Fourier transform of a measure is "small" in a 
meaningful sense when linear structure is absent, in certain circumstances. For instance: 

(1) If fi is supported on an appropriately curved submanifold of M. d , then — s> at 
a certain rate as |£| — > oo. 

(2) If fj,^ is a random measure, with appropriate properties, then for typical oj, jlZ has 
small supremum norm; or in other contexts, jt^(£) tends to zero at an appropriate 
rate as |£| — > oo. 

(3) Let p be a large prime, and for x € Z p let /i p (x) = 1 if x is a quadratic residue 
modulo p, and n p {x) = otherwise. Then with a natural normalization of the 
Fourier transform, < Cp~ x l 2 for all £ ^ 0, whereas /i p (0) X 1. 

Smallness of the Fourier transform may be reformulated in terms of a bilinear expres- 
sion via the identity ||ju||oo = su P/,g^o | // f( x )9(y)dfi-(x — y)|/||/||2||s||2- This formulation 
suggests multilinear extensions, involving e.g. ff f(x)g(y)h(x + y)dfi(x — y). While various 
possible inequalities can be considered, we are primarily interested in bounds in terms of 
H/llplMlglWIr withp~ 1 + g~ 1 + r~ 1 = 1; such quantities scale naturally from the perspective 
of ergodic theory. 

If G is a finite Abelian group and /x : G — > C has [|a*||oo ^ 11/^11 1> under appropriate 
normalizations, then (i is sometimes said to be uniform [7]. There are higher-order notions 
of uniformity, due to Gowers [7J, which have a multilinear character. Gowers uniformity is 
closely related to the type of smallness studied in this paper, but here we are dealing with 
rather singular measures. 

In this paper we investigate the extension of this smallness principle to higher-degree 
multilinear expressions, for natural families of random measures. In £j2] we give an example 
which demonstrates that linear structure is no longer the natural consideration. Indeed, 
for one of the most canonical (deterministic) examples of all, the natural trilinear extension 
satisfies no smallness condition, due to the presence of quadratic structure. In ^}3]we state 
our main results, which concern two classes of random measures. For one of these classes, 
our results are quite satisfactory, but for the other they apply only for a certain range of 
parameters which may not be optimal. 

The author is indebted to Patrick LaVictoire for useful discussions. 

2. A Nonlinear Obstruction 

For convenience, the following example is given in the context of certain finite groups, 
rather than Z; there are no essential differences. Let d > 1 and let p G N be any prime. 
Let Z p be the finite cyclic group Z/pZ. Let G p = Z^ x Z p . For x = (x±, • • • , xa) € Z^, we 
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write \x\ 2 = X2j=i x j- Write G p 3 x = (x' ,Xd+i) £ x 7L p . Let fi p be the function on G p 
denned by 

ip~ d if x d+ i = \x'\ 2 
I otherwise 



Hp{x' ' , Xd+l, 



m p (x) = pT d ~ x for all x € G p , 
and Up = Hp — m p . m p satisfies rn p (^) = ~jlp~(£) for £ = 0, and m^,(£) = for all £ 7^ 0. 

Define the Fourier transform /(£) = X^eGp f(x)e~ 2m ^' x / p , where £ • x = an d 
^ € G p . By a well-known identity for Gauss sums, 



max |/ip(f)| =p d / 2 , 



and consequently 



max 1^,(01 = P 



-d/2 



Therefore by Plancherel's identity, there is a bilinear inequality 

(2.1) I H*Mv>p(*-v)\ <p- d/2 \\fh\\gh 

x,y<=G p 

where denotes the i q (G p ) norm. 

Does (|2.ip extend to a trilinear inequality 



(2-2) I Yl f{x)g{y)h{x + y)u p (x-y)\ < Cp-"||/|| 2 || 5 || 2 ||^|U 

x,y£G p 

for some p = p(d) > independent of p? 
Observation 2.1. iVo inequality of the form (|2.2|) is ua/z'd. 
To prove this, set 

h(x) = e^T/P 

= e 27ri[x (i+1 -2|x'| 2 ]/p 

gfrjA _ e 27Tj[-X d+1 -2|x'| 2 ]/ P _ 

Then f(x)g(y)h(x + y) = e 2 ^( x >y)/p where 

y) = |x' + y'| 2 + x d+1 - - 2|x'| 2 - 2\y'\ 2 . 
For (x,y) in the support of p p (x — y), x^+i — yd+i = W — y'\ 2 and consequently 

$(x, y) = |x' + y'| 2 + |s' - y'| 2 - 2|x'| 2 - 2|y'| 2 = 0. 
Therefore the contribution of [i p to our trilinear form equals 

£ f(x)g(y)h(x + y)v p (x-y)= £ p"* = p 2d +V d = / + \ 

x,y£G p x,y:y d+1 -x d+1 = \y'-x'\ 2 

while 

ll/l|2N|2||/ i ||oo = (p d+1 ) 1/2 -(p d+1 ) 1/2 -l=/ +1 . 

On the other hand, 

2 /(x) 5 (y)/ l (x + y)m p (x- ?/ )=p- d - 1 £ e ^*(**)/P. 

x,y£G p x,y£G p 
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For fixed y, x' , $>((x', t),y) takes the form c(x' , y) + t, and 

^ e 2iri(t+c(x',y))/p = e 2iric(x',y)/p ^ ^it/p = q 

tez p teZp 

Thus in all, 

^2 f{x)g{y)h{x + y)u p (x-y)= ^ f(x)g(y)h(x + y)fJ, p (x - y) = [I/H2IMI2IWI00; 

x,yeG p x,y£G p 

there is no cancellation in the sum. 

3. Results 

Our setting is the set Z of all integers, and we will work in terms of norms L P (Z) = £ p . 
Let ft be a probability space, equipped with jointly independent, identically distributed, 
{0, l}-valued selector variables {s(u,x) : x € Z}, such that s(u,x) = 1 with probability 
p and = with probability 1 — p. Let N be any large positive integer. Let r(u,x) = 
(Np)^ 1 s(u), x) — N~ x for integers x G [-N, N], and r(ui, x) = otherwise. Thus E w r(w, 2) = 
p{Np)~ l - N^ 1 = for x € [-iV, iV]. 

Let {Lj : < j < M} be Z-linear mappings from Z to Z. Assume none of the Lj are 
scalar multiples, over Q, of (x, y) 1— > x, that none are scalar multiples of (x, y) h-> y, and no 
Lj is a scalar multiple of Lj . 

In Theorem 13.11 we study multilinear operators 

M 

(3.1) ^M/, ,g M )(x) = ^2f(y)r(u,L (x,y)) Y[gj(Lj(x,y)). 

y j = l 

These depend also on N, and we are interested in their properties as N — > 00. Define the 
operator norm 

||XL||op = sup \\T u (f,gi, ■ ■ ■ ,5m) H2 

f,91,— ,9m 

where the supremum is taken over all functions satisfying H/H2 < 1 and ||5j||oo < 1 for all 
3- 

Theorem 3.1. Suppose that M > 1 and < 7 < 2~ M . There exist e > and C < 00 suc/i 
tfmi for all N > 1 and p > iV -7 , 

(3.2) IEjTj| op <CN- 



-e 



The constant C is independent of N. We do not know whether the conclusion may hold 
for a larger range of exponents 7. 
Of course 

E w sup ||T w (/,5i,--- ,5m) II2 > sup E w sup||r w (/,5i,--- ,5m)||2- 

f,gi,— ,9M 91,-, 9m f 

The latter quantity is easier to analyze; see Proposition 14.11 which gives a satisfactory 
bound for all 7 < 1, for all M. 

An ergodic-theoretic consequence is as follows. Let T be an invertible measure-preserving 
transformation on a probability space (X,fj,). For each u) € f2, specify a subsequence 
(rjfc(cj) : = 1,2, •••) of the natural numbers, as follows. Let 7 € (0,1). Let be a 
probability space equipped with a family of jointly independent random variables {s n (u) : 
n € N} such that s n (uj) = 1 with probability n -7 , and s n (w) = otherwise. For each u £ fi, 
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specify the random subsequence (n^uJ^keN to consist of all n G N for which s n (oj) = 1, 
listed in increasing order. 

It has been proved p], [5], [6] that for all f u ■ ■ ■ , f M G £°°P0, 

JV 

(3.3) lim TV" 1 V f x {T k {x))h{T 2h {x)) ■ ■ ■ f M (T Mk (x)) exists in L\X, dfi(x)). 

TV— »oo — ' 

k=l 

This fundamental result, together with Theorem 13 .1\ give 

Theorem 3.2. If < 7 < 2 _M+1 i/ien /or almost every lo £ Q, for all , /a/ € 

L°°(X), 

JV 

(3.4) lim A^ 1 V h(T n «(x))f 2 (T 2n k(x)) ■ ■ ■ f M (T Mn *(x)) exists m L\X, dfx(x)). 

7V-»oo L — ' 
k=l 

A generalization of Theorem 13.11 is natural, and of interest, will denote the function 
y 1 — V e~ l ^ y . With the above notations, define 

AI 

T*{f, g u ■ ■ ■ , g M )(x) = sup I V e-*»f(y)r(u, L (x, y)) TT g s (Lj{x, y)) 

1 V 7=1 
= sup|r w (e^/,5i,-- - ,g M )(x)\. 

Multiplying each function gj(z) by a factor e~ l ^ z , and taking the supremum over all 
(Cj^i?'" >£m), introduces no additional generality since each e~ l ^ Lj ^ x ' v ^ can be factored 
as e !a i^i e - ,,, j^j for appropriate coefficients aj,bj. 

Theorem 3.3. For each < 7 < 2~ M ~ l there exist 5 > and C < 00 suc/i that for all 
N > 1 andp> N~ J , 

(3.5) E^T^^CAT 5 . 

The case M = has an ergodic-theoretic consequence, for return times of sparse random 
subsequences. 

Theorem 3.4 (Return Times). Let (X,A,/x,t) be any dynamical system, such that fi is 
a probability measure and (X, A, fj,) is isomorphic to [0, 1] equipped with Lebesgue measure 
and the Lebesgue a -algebra. Let < 7 < \. Let {71^(0;)} be a random sequence, constructed 
as in Theorem \3. 2\ Let p G (1, 00] and q > 2. Then for almost every u G £1, the following 
holds. For each f G L P {X) there exists a subset Xq C X of full measure such that for every 
dynamical system ( Y, J 7 , 1/, cr) , every g G L q (Y), and every x G Xo, 

N 

lim N~ 1S T f(r nk{ul) (x))g(a nk{u) (y)) exist for v- almost every y^Y. 

k=l 

Thus far we have considered random variables which depend only on L(x,y) for some 
linear function L. Next, we consider analogous results for random matrices (r^(x, y)) x y , 
with all entries mutually independent. Consider jointly independent random selector vari- 
ables s UJ (x,y) for (x,y) G [— N, ■ ■ ■ ,N] 2 , satisfying s w {x,y) = 1 with probability p, and 
= otherwise. Then E(^ x s w (x, y)) X Np and s u (x,y)) X Np. Define r L0 {x,y) = 

(Np)- 1 (s ul (x,y) -p) so that ^r^x^y) = 0. 
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Theorem 3.5. Let M > 2 and < 7 < 1. For any {Lj : < j < M} satisfying the 
hypotheses of Theorem \3.1\ and for any e > there exists Cm e < 00 such that for all N > 1 
and all p > , the multilinear forms 

M 

%(fi,--- ,/m) = ^2r UJ (x,y)Y[fj(Lj{x,y)) 

x,y j=l 

satisfy 

(3.6) EJTJ 0P < C M ,eN £ N'^' 2 . 

In this formulation, 7^(/i, • • • , /m) is a complex number, not a function. It is possible to 
generalize Theorem 13.51 by incorporating factors e~ l ^ y , with a supremum over all £, parallel 
to Theorem 13.31 

The conclusion of Theorem 13.51 fails to hold for 7 > 1 . The method of proof of Theo- 
rem [3J] applies only in the restricted range 7 < 2~^ M ~ 2 ' , and with some added complica- 
tions since the Fourier transform cannot be applied directly. However, our proof for the full 
range 7 < 1 proceeds along quite different lines, relying on entropy considerations along 
with large deviations bounds. 

4. A Preliminary Bound 

The order of quantifiers in Theorem 13.11 is significant. In this preliminary section we 
discuss a variant in which the supremum in the definition (|3.1j) of is taken only over 
/, with gi, • • • ,gM fixed. For this variant, and even for a substantial generalization, more 
complete results can be obtained, by a simpler method. 

Generalize T w by considering linear operators 

(4.1) L Uth (f)(x) = f(v)r(w, x - y)h(x, y), 

y 

where h £ £°°(Z 2 ) is an arbitrary bounded function of two variables. In particular, this 
includes the case where h(x,y) = Y\jLi 9j{Lj(x, y)), for arbitrary M and gj € 

Let U, p, N, s(u), •), r(u),x) be as in Theorem 13.11 Regard L^h as a linear opertor on 
L 2 ([-N,N]). 

Proposition 4.1. For any e > 0, there exists C e < 00 such that for every h G 

(4.2) EjL^Wop < CeN^NpY^WhW^. 

Proof. Fix N. Denote by trace the trace of a self-adjoint linear operator on £ 2 ([— N, N]). 
Fix h, and write L w = L w ,h- Since 

EjLjop < OMA* Hop) 1/29 < (E w trace [(L:L W )"]) 1/29 , 
it therefore suffices to show that for any positive integer q, 

(4.3) E u trace (L^) 9 < C q N ■ {Np)-" \\h\\%. 

Write ft = (m, • • • , ri2q) where nj € [— N, N] are arbitrary. Define n2 q +i = n\. All sums 
over n written below are understood to be taken over all such vectors n E [— N, N] 2q . We 
say that n is admissible if in the vector 

rh = (n 2 - ni,n 2 - n 3 ,n 4 - n 3 ,n 4 - n 5 ,n 6 - n 5 ,n 6 - n 7 , • • • n 2q - n 2q+ i), 

no integer appears as a coordinate exactly once. We write J^t to denote the sum over all 
admissible ft. 
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With this notation, the trace can be expanded in the form 

trace (L*L W ) 9 = ^ H(n) Y[ r*(w, n i+1 - n*) 
H i=i 

where r*(cj, nj+i — rii) = r(u, n^i — rii) if i is odd, and = r(uj, rii — rn+i) if i is even. Here 
H(n) is a product of 2q factors of h, so ||.ff||#» < ||^||oo- Moreover, 

(4.4) E w trace = £t # (™)^ ElSi "i+i " m), 

since 

JJr*(o;,n i+ i - m) = 

i=l 

by independence whenever n is not admissible. 

If n is admissible, then the number K of pairwise distinct coordinates of rh(n) = (n2 — 
^lj n 2 — n 3, • • • ) satisfies K < q. Fix any K G [1, <jr]. The number of m = (mi, • • • , m2 q ) G 
[-2A^,2iy] 2,? having exactly if pairwise distinct coordinates is < C q N K . The number of 
such rh possessing the additional property that mi — m<i + — + • • • = is of course 
no greater. The number of n G [-N, N] for which fh(n) has exactly K distinct coordinates 
is therefore < C q N K+1 ; one additional power of N arises, because n is determined by rh(n) 
together with m, though not by m(n) alone. 

If fh(n) has K pairwise distinct coordinates, then 

2q 

E w []|r*( W ,n i+ i -m)\ < C q (N P y 2<1 p K . 
i=i 

Therefore the total contribution made to (|4.4p by all admissible indices ft having K pairwise 
distinct coordinates is 

< C q N ■ {Np)~ 2q N K p K < C q N ■ (Np) q ~ 2q = C q N ■ {Np)~ q 

since Np > 1 and K < q. Summing over all K gives (I4.3D . □ 

5. Reduction of degree of multilinearity 

The proof of Theorem 13.11 will proceed by descending induction on the degree of mul- 
tilinearity, M. In this section we set up a simple lemma which implements the inductive 
step. 

It will be useful to reformulate and to modestly generalize the operators T u . Consider a 
scalar-valued multilinear form 

M 

(5.i) T(h,--- Jm, p )= Yl \[fA L j(x,y))p( L ^y)) 

(x,j/)e[-AAr,A7V] 2 j=l 

where M > 2, : Z t— > R, each Lj : Z — > Q is Q-linear, Li is not a scalar multiple, over 
Q, of Lj if i ^ j, and p : Z — > R. We operate under the convention that fj(Lj(x,y)) is 
to be interpreted as whenever Lj(x,y) G Q \ Z, and likewise for p(Lo(x,y)). Moreover, 
all fj and p are supported in [—AN, AN]. Here ^4 > 1 is any positive integer, which is 
initially 1 but will increase in a controlled manner with each inductive step. We seek to 
bound T(/i, • • • , /a/j p) by a suitable constant times H/1H2II/2II2 Y\j>2 ll/jlloo- This suitable 
constant will depend on A, in a manner which will not be specified. In the application, p 
will depend on oj G and will be constructed from r(w, •) in a recursive manner. 
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By assumption, Z 2 3 (x,y) i-> {L\{x, y), L 2 (x, y)) is injective, and has range equal to a 
lattice of rank 2. Make a linear "change of variables" (x,y) i— >• (u, u ) = A(Li(x, y), L 2 (x, y)) 
where ^ A G Z is chosen so that ALj(x, y) G Z for 2 = 1,2 for all (cc, y) G Z 2 . The range 
of XL\ need not be arranged to be all of Z; set /i = at all integers not in this range, and 
likewise f 2 = at all integers not in the range of AL2, and for j > 2, fj = and p = at 
all appropriate points so that T(/i, • • • , /m, /o) may be rewritten as 

T(fi,--- Jmp) =^2p(L (x,y))f 1 (x)f 2 (y) fj(Lj(x,y)); 

x,y 3<j<M 

rij>3 fj{Lj{x, y)) is interpreted as 1 if M = 2. The sum is now over (x, y) G [—AN, AN] 2 for 
a possibly increased value of A. The functions fj and linear functionals Lj,Lo appearing 
here are not the same as those in (|5.ip . but the new functionals continue to satisfy all 
hypotheses, and the new functions fj have all L p norms equal to the corresponding norms 
of the old functions fj. 
By Cauchy-Schwarz, 

\T(h,--- Jm,p)\ 2 

x y,y' i>2 

where x, y, y' are all restricted to [—AN, AN]. Substitute y' = y + z to reexpress the triple 
sum as 

^j>*(L(*, y ))n//(^?/)) 

z x,y j>2 

where L 2 (x,y) = y, z G [—2 AN, A2N], 

f J z (u) = fj(u)fj(u + L j (0,z)), 



and 
Thus 

where 



p z {u) = p{u)p(u + L (0,z)). 

|T(/i,- Jm, p )\ 2 < ||/i||l£|T(/ 2 v-- Jii,P z 



M 

T\f z 2 , • • • , f M ,p z ) = ]T p z (L(x, y)) J] f*(Lj(x, y)) 

x,y j=2 

takes the same form as did T, with the primary change that the number of functions fj 
has been reduced by one. 

Now fi(L 2 (x,y)) = f 2 (y)f 2 (y + z), so 

(5-2) ^ll/llb^Civ 1 / 2 !!/^! 2 

z 

by Cauchy-Schwarz. Likewise since all functions are supported in [—AN, AN], 

(5-3) l|/3l|2<^ 1/2 H/3||L- 

Certain values of the parameter z are exceptional, and will be treated as follows. By 
C auchy- Schwar z , 

ll/llll < II/2IH 
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for all z. Likewise ||/o 2 ||i < ||p|||- Therefore for any z, 

\r z (fi,--- ,r M ,P z )\ < n/iiiinilz/IU^pEl^^o^^))! 

j>2 y x 

< ll/ 2 i n ll/X ™p ^ 

j>2 y x 

<\\f2\\lYl\\fj\\ 2 oo\\p Z h 

< 11/2111 nii/xi* 

i>2 

Define 

l|T(p)|| p = sup|T(/i,--- Jm,p)\ 
where the supremum is taken over all functions satisfying ||/j||2 < 1 for j £ {1,2} and 
1 1 fj | |oo < 1 for j > 2. Similarly 

||"r(//)||op = sup|T(/ 2 ,--- Jm,P z )\ 
where the supremum is taken over all functions satisfying ||/j||2 < 1 for j £ {2,3} and 
||/iHoo < 1 forj > 3. 

Write \B\ to denote the cardinality of a set B. We have shown: 

Lemma 5.1. For any set B cZ, 

(5.4) ||r(p)iu < civ 1 /W||r*(p*)||V2 + |fl|i/2 . Wa . 

Remark 5.1. It may be helpful to understand the role of the different terms here, and 
the question of whether there is any essential loss when Lemma l5.1l is applied. The factors 
of N 1 / 2 in (|52H and (JOJ) are natural, cannot be improved, and represent no loss. Indeed, 
when p(x) x A r ~ 1 for all x £ [—N,N], ||T z (p 2 )|| p = 0(N~ l ) for all z, compensating 
exactly for the leading factor of iV 1 / 2 in (|5.4p ; thus this factor does not in and of itself 
represent any loss. As the support of p becomes sparser, ||p||oo becomes large in order to 
maintain the normalization x 1. Since p z is a product of two factors of p, \\p z \\oo 

becomes larger for many values of z, essentially by a factor of A^H/oll^ 2 relative to the 
non-sparse averaging case. But for this loss there is also compensation; the support of 
p z (x) = p(x)p(x + z) is, on the average with respect to z, correspondingly smaller than 
the support of p. For natural classes of random probability measures p u , a simple back- 
of-the-envelope calculation gives heuristically that N 1 / 2 \\T z (p z )\\ op = 0(1) for typical z,u, 
provided that the support of p has cardinality S> iV 1 / 2 . Thus one may expect to have no 
essential loss in applying Lemma l5.1| when dealing with random p whose supports are not 
too small. 

However, if the support of p has cardinality N, then p z will vanish identically for 
most values of z, and Lemma 15.11 must yield poor bounds. It is this issue which leads to 
the restrictions on 7 in our main theorems. 

In our application of Lemma 15. 11 p will take the form 

(5.5) p(x) = p u (x) = JJr(w,x + Zi) 

iei 

where / is some finite index set, and it will always be the case that 

Zi 7^ zj whenever i ^ j. 
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Then 

P Z ( X ) = II r ( w > x + Zi)r(u, x + Zi + L(0, z)). 

■iei 

Here z i-> L(0, 2) is injective. In this situation, we define the set I? of exceptional values of 
the parameter z to be 

(5.6) B = {z : there exist i,j El such that z% = Zj + L(0, z)}. 

Then 

|B| < |/| 2 , 

since z *- > L(0, z) is injective. Moreover, \i z £ B, p z takes the same form as did /), with 
the size of / increased; B is defined so that the condition (|5.6p is inherited from p by p z . 
With this definition of B, then, 

(5-7) l|T(p)||op ^CN^maxWT^W^ + CWph 

where C depends only on A, \I\. 

6. Proof of Theorem 13.11 
Let p be of the form (|5.5p . Then 

For each x, the |/| factors r 2 (u,x + Zj) are jointly independent since {zi} are distinct. By 
definition, 

E(r(-,x) 2 ) < C(Np)~ 2 p = CN^p" 1 
for some constant C < oo. Therefore 

E(n^(^^ + ^) 2 ) < c |7| Ar 21 V |J| 

and hence 

(6.1) 1MM§) < ^'^'V' 1 '- 

A stronger result will be required. The supremum sup., in the next lemma is taken over 
all |/|-tuples z = (zi, ■ ■ ■ ,z\n) satisfying Zi ^ Zj whenever i ^= j. 

Lemma 6.1. Let p^ )Z {x) = Y\iei r ( UJ ' x + z i) where Zi ^ Zj whenever i ^ j. Then for any 
q < oo there exists C q < oo independent of z, such that for every ^ £ T, 

(6.2) ^(iraon < c q {N-w+ 

Moreover, for any e > i/iere exists C e < oo such that 

(6.3) E a; (sup||A^|| L oo (T) ) < C £tlIl N-^ +£ p-W 2 

z 

The proof will be given below. By Parseval's theorem, (16. 3|) implies 

(6.4) E w (sup \\p u J 2 ) < C e Cl 7 liV~l 7 l + ^ £ p- |7|/2 . 

z 

We are now in a position to argue by induction on the degree of multilinearity M. Some 
additional notation is required, because the base case in the induction depends on M. Let 
T(w) = Tm(oj) be the M-linear scalar- valued form to be analyzed; thus p = r. Define 
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Bm+i = 0, and Bm = {0} C Z . For z ^ Bm define 7m_i(w,z) to be the associated 
M — 1-linear scalar form, as discussed above. Define Bm-i to be the set of all (z\, z-i) 6Z 2 
such that z\ £ Bm and Z2 does not lie in the finite exceptional set B associated to Z\ in the 
above discussion. For (21,2:2) ^ Bm-i let Tm-2(u, (^i,^)) be the associated M — 2-linear 
scalar form. Continue, constructing 7~M-k(w, z) for k = 0, 1, 2, • • ■ , M— 2 for (most) z € Z fc , 
and exceptional sets Bu-k C For each z ^ Bm-Ui {C : ( 2 >C) S £>M-fc-i} is a finite 

set whose cardinality is bounded by a constant which depends only on k. By (15.TH . 
(6.5) 

||7fc(w, (zi, • • • ,z M -k)\\op < CN 1/2 max ||7fc-i(o;, (z x , ■ ■ ■ ,z M -k,0)\\op+C\\p u ( zu ... tZM _ k ) 

(:(z,Q(£B k 

Lemma 6.2. Suppose that p > N^ 1 . Then for any e > and k € {2, 3, • • • , M}, 

(6.6) E4 sup \\Tk{^z)\\ op )<C E N^N-^- k N-^- k p-^- k -\ 

Specializing this conclusion to k = M yields the sought-for bound. 
Corollary 6.3. Provided that p > N^ 1 , 

(6.7) E w (||7»|y < C e N £ N- 2l - M p- 1 ' 2 . 
Ifp> N~~< and if 7 < 2"( M " 2 ) tfien 

(6.8) E w (||T(o;)|| op ) = 0(N~ 5 ) for any 8 < I(2"( M - 2 ) - 7 ). 

Proof of Lemma UTM We proceed by ascending induction on k. Tk(uj, z) is associated to an 
index set / of cardinality |/| = 2 M ~ k . In the base case k = 2, 7o(oo,z) is the bilinear form 
associated to a linear operator defined, in appropriate coordinates, by convolution with 
z)|| p is simply the L 2 (Tj) — > L 2 (Z) operator norm of this convolution operator, 
which is the L°° norm of the Fourier transform p^~ z . Therefore by Lemma |6.1[ 

E w sup \\%(u,z)\\ op < CeN'N^N-^p'^/ 2 
z<£B 3 

= C £ N^N l / 2 N~ 2M - 2 p- 2M - 3 
= CeN^N-^N-^p- 2 ™- 3 

^ , T Uc.r 9 1 - fe , T nM-k nM-k-1 

= C £ N l+e N~ 2 N~ 2 p~ 2 

which is the bound stated for k = 2. 
For the inductive step, 

E w max \\T k (uj,z)\\ < CN l ' 2 ¥. w max \\T k -i(oo, (z, ()) ||^ 2 + CE W max \\p u , z \\ 2 

<c £ nV 2 (n^ n -2^ n -^-\-^^^ 



< C £ N 1+£ \N 2 N 2 P 2 J +OT w max||p W| ,|| 2 
C £ N 1+e N~ 2 N~ 2 p~ 2 + Cl^max p w « 2 - 



The first term on the final line is of the desired form. By Lemma [671 

E w max H/^112 < CeN^N-W+tp-W 2 
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where \I\ = 2 M ~ k . Thus 

I u max \\p Ujg \\ 2 < C £ N2+ e N~ 2 p~ 2 

z 

< C e N 1 -*N-*- k N-* U - h p-*'- h - 1 
since k > 2. This completes the inductive step. □ 

Proof of Lemma \6.1\ It suffices to treat the case where q is an even positive integer. Thus 
we may replace q by 2q. For any £ G T, 

E4|/^(e)| 29 )=IE. £ £ n^(n Q )n^(^)e-^ a n Q -E, 8 n,) 

< ^ |e w JJ /> W)je (n a )|, 

rai,---,"2g = 1 

where n a = n' a _ for (7 > a. 

For m G Z and n = (ni,--- , rt2 g ) G Z 2|J , define u(m,n) to be the number of indices 
(a, i) € {1, 2, • • • , 2q} x / which satisfy n a + Zi = m. Since 2j ^ whenever i ^ j, there 
can be at most one such pair with a given value of a. For fixed k, the |/| random variables 
Pu},k+z t are jointly independent and ^u{Pw,k+zt) = f° r each i. Therefore 

2q 

K u Y[ Pu},z(n a ) = unless for every m G Z, i/(m, ?i) 7^ 1. 
0=1 

We say that n G Z 2<? is negligible if there exists at least one m G Z satisfying z/(m, n) = 1. 
The number of nonnegligible multi-indices n is < C^ij^A/V) 9 . To prove this, given n, 
partition the indices 1,2, ••■ , 2g into equivalence classes, by saying that n a is equivalent 
to ng if there exist indices i,j such that n a A- Zi = np + Zj, and forming the smallest 
transitive relation = generated by these relations. Each n is thereby associated to a unique 
equivalence relation on {1, 2, • • • , 2q}. The number of such relations is a finite quantity, for 
each q. Consider all n associated to a given relation, with C distinct equivalence classes. 
C < q, since each equivalence class contains at least two elements. If {/3} is a collection of 
indices a, with exactly one chosen from each equivalence class, then for every a £ {/?}, n a 
is determined from some rip by an equation n a + Zj = np + Z{. Therefore at most (AN) C 
values of n remain undetermined. Therefore there are at most (AN) C < (AN) q indices ft 
associated to any given equivalence relation. 

If ft is not negligible then 

2q 

IE. [J A".*( n «) ^ C q {Np)- 2 ^\ J] p = C q (Np)- 2 ^p^ 

ot=l m:u(m,n)>2 

where p(n) is the number of m G Z satisfying v(m,n) > 2. Plainly p(n) < 2q\I\/2 = q\I\, 
so 

2q 
a=l 
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Summing over all nonnegligible ft gives 



^Jp^(0\ 2q < C^niNpY^p^iANY = C qM Am^- 2 ^p^, 



as was to be proved. 

To derive (|6.3p is from (|6.2p . temporarily fix any £ E T. For any q < oo, 



E w sup|A^(£)|) sup |j5^(OI 9 ) <E W (V |A^(OI 9 ) 

Z / Z 

z 

= ^M\^(0\ q )<C A ,\ Il N\ I \ S npE4\f^ z (0\ q ) 

z 

< C^N^CgC^N^N^p-^ 2 , 
since at most C^j/i^' 7 ' values of z arise. Choosing g = \I\/e yields 

(6.9) M su pI/^(OI) < C £> |i|iV- |J|+ 5 + V |7|/2 . 

This is weaker than (I6.4|) . in which |p^(£)| is replaced by ||^~z||oo- But since p U)Z is 
supported on an interval [—AN, AN], by the Shannon sampling theorem 

|| Aj,jz 1 1 oo 

< max |a^(£j)| 
j 

where {£-,} C T is an arithmetic progression consisting of KAN points with spacing 
K~ 1 A~ 1 N~ 1 , where K is an absolute constant. Since such a progression consists of 0(N) 
points, the same reasoning used to introduce the supremum over z in (I6.9P also suffices to 
introduce the supremum over all at the expense of another factor of N e . Thus (I6.3P 
follows from (I6.9p . □ 

7. Extensions 

In this section we present an extension of Theorem 13.11 then show how Theorem 13.31 is 
an almost immediate consequence of this extension. Finally, we show how the application 
to return times of random subsequences is deduced from Theorem 13.31 

For K > 1 let Sk C 1> k be the set of all z = (z±, ■ ■ ■ ,zk) E Z satisfying 

(7.1) i^j^Zi^zj. 
Define p(u, z) : Z -> R by 

A' 

(7.2) p( W ,z)(x) = iV / ^ 1 J]r( W , 2 ; + z i ). 

i=l 

Consider multilinear operators 

M 

(7.3) r W) *(/,</i,--- ,flfAf)(x) = ^/(y) JJpj(-Lj(x,y)) 

2/ 5=1 

where satisfy . . . Define 

||r|| op = sup ||T w (/,5i, • • • ,gu)h 

where the supremum is taken over all f,{gj} satisfying ||/||2 < 1 and ||<7j||oo < 1- Since 

EJ P (lj, z)||i x N K -\Np)- K ■ N ■ p K = 1, 
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the factor N K_1 in the definition of p(u, z) is the natural normalization here. 
The same analysis as above establishes: 



■7 



Theorem 7.1. There exist 7 = j(M, K) > and 5 > 0, C < 00 such that for all p > N~ 

(7.4) E w sup llT^Hoj, < CN~ 5 

z£Sk 

uniformly for all N. 

Proof of Theorem \3.3[ Introduce a function £(cc, z) so that 

\Tu,z{es( x ,z)fi9ir-- ,9m)(x)\ > \ sup \T U)Z (e^f,g!, ■ ■ ■ ,g M ){x)\ 

for all ieZ, where e%(x, z) denotes the function x i-> e _ *^ x ' 2 ^. Now 

(7.5) ^^(e^^)/,^,--- ,gM)\\l 

= E E ^ yM w > z )( L ( x ' f ) + «0) ^^^(y) n *>( L i (*> y)) 

w x,y j 

with the same notations for f w ,gj tW as in the beginning of the discussion of Theorem 13.11 
Now in (|7.5p . set go jW (x) = e ~ l i{ x , z ) w _ Then ||<?o,io||oo = 1- Regarding this expression as a 
multilinear form in f w ,{gj tW ■ < j < M}, it is in a form to which Theorem 17.11 applies, 
yielding the desired bound. □ 



Sketch of proof of Theorem \3.4\ For j £ N let Nj be the number of indices k such that 
n k € [l,2 j+1 ]. With probability > 1 - e _c2 ^ for some c,5 > 0, Nj x 2 { - 1 ~^ : > . Moreover, 
Nj — Nj—i x Nj, with similarly high probability. 
We will sketch the proof of a weaker result, namely: 

(7.6) limiVri V f(r n "(x)) 9 (a n "(y)) exists, 

fe:n fe e[l,23+i] 

with the same quantifiers as in Theorem l3.4t the only distinction is that we average here only 
over initial segments n k £ [1)2-?], rather than over arbitrary initial segments n k € [l,iV]. 
The stronger conclusion stated in the theorem is proved by modifying the proof below, as 
follows: Partition (2 J_1 ,2- 7 ] into subintervals of lengths 2( 1_r? )- J for sufficiently small 77 > 0, 
and augment the sequence (2 J : j £ N) in the argument below by adjoining all endpoints 
of these subintervals. Details are left to the reader. 

For / £ £ p = L P (Z) and g € £ q introduce the differences 

A^(f,g)(x,y) = (N j -N J _ l )- 1 ^ f(x+n k )g(y + n k ) -2"' £ f(x + n)g{y + n). 

n fc e(2J',2-J] nG(2J,2J'+ 1 ] 

We will show momentarily that 

(7.7) E w sup||sup||A-(/, 5 )(x,y)||, 2 ||^ < C2^ 5 

f 9 

for some 5 > and C < 00, where the suprema are taken over all f,g satisfying ||/||#> < 1 
and \\g\\i2 < 1, respectively. 

For g € £°°, the same bound holds with 5 replaced by 0. A corresponding inequality with 
£ 2 replaced by £ q then follows for all q G (2, 00), by interpolation between the endpoints 
q = 2 and q = 00. It then follows by transference that a corresponding maximal inequality 
holds for arbitrary dynamical systems (X, r) and (Y, a). This maximal inequality, together 
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with the almost everywhere existence of the full averages (j3.3j) for L°° functions, yields 
To establish (|7.7|) . write (7 in terms of its Fourier transform to represent 



where T = M/27rZ and 



Then 



mujit, x) = N- 1 Yl f( x + n k)^ nk - 

n fc e(2J',2J+ 1 ] 



\&j{f,9){x,y)\\q < Csup |m W j(^,x)| 



for every 5 satisfying ||p||^2 < 1, uniformly for every oj. 
Now 

sup|m W j(^,x)| = \T* d (f)(x)\ 

where T* ■ is a maximal function of the type treated in Theorem 13.31 associated to the 
random set {n^ £ (2 J , 2 J+1 ). Recall that this set was specified using independent random 
selector variables s n (oj), such that s n (uj) = 1 with probability x. n -7 ; for n € (2 J , 2 J+1 ] this 
probability is X 2~ J7 . Theorem 13 . 31 therefore applies, and asserts that E W ||T* -(/^^p < 2 _J<5 
for a certain 5 > 0. □ 



8. A Variant 

In this section we discuss the variant in which the random variables are independent 
for distinct values of (x,y), rather than depending only on some scalar- valued linear func- 
tional L(x,y). Consider jointly independent random selector variables s UJ (x,y) for (x,y) € 
[-N, • • • , N] 2 , satisfying s u {x, y) = 1 with probability p, and = otherwise. Then E(X^r a u (x, 
Np and EQZj, s u (x, y)) x iVp. Define r UJ (x,y) = (Np)' 1 (s u {x,y) -p) so that E u r u (x,y) = 

0. Let Tii be the associated multilinear operators. We will sometimes write T^ M ^ to indicate 
the degree of multilinearity of T w . 

The factor (iV/j) -1 in the definition of r^ represents the natural normalization, so that the 
expected value of the norm of the linear operator / h-> Yl y \ r Lu(x,y)\f(y), on L 2 ([— N, N]), 
is uniformly bounded. More precisely: 

Lemma 8.1. For any M > 2 A < 00, 70 S [0, 1), and family {Lj} satisfying the hypotheses 
of Theorem \3.1\ there exists C < 00 such that for any 7 £ [0, 70] and any index i £ 
{1,2,- •• ,M}, /or onyJV>l, 

M 

E w ([ sup £ |r w (x,y)ini/i(^'(^y))l] 2 ) < C A7 log(TV) 2 , 

fi,-,fM ( Xl y) e [-AN,AN] 2 3=1 

where the supremum is taken over all functions satisfying \\fi\\i < 1 and ||/j||oo — 1 f or a ^ 
3 + i- 
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Sketch of proof . At the expense of a factor depending on {Lj}, we may change variables so 
that Li(x,y) = x. Then 

M 

E M x >v)\ II \MH^y))\ < [/*(*)[ E 

y 3=1 y 

An application of Chernoff 's inequality (see below for a similar argument) yields 

E w sup V|r w (x,|/)| < Clog(2 + ^iV), 

XE[—AN,AN] y 

and the same for the expectation of the square. □ 

^u{^2y \ r ^{ x iV)\) ls a l so bounded below by a strictly positive constant, independent of 
x. Since the random variables Yl y \ r uj(x,y)\ are jointly independent, it is easily seen that 
E w (sup :c J2 V \ r u(,z,y)\) is not uniformly bounded as N — >• oo. 

Theorem 13.51 will be proved by induction on the degree M of multilinearity. The following 
base result will be proved later. 

Lemma 8.2. For any e > 0, E w ||7j 2) || op < C e N £ N~^-^/ 2 . 

We turn to the proof of Theorem 13.51 Let M > 3, and 7 £ [0,1). By a simple interpola- 
tion, it suffices to prove the inequality under the assumption that each function fj equals 
the characteristic function xe- of a set Ej C [-N, N]. We will simplify notation by writing 
Tw{E\ , • • • , Em) for %i(XEi , • • • , XE M )- Introduce the restricted weak type norm 

(8.1) ||7l|weak= sup lE^^-^mEu--- ,E M )\. 

E u -, E M 

Suppose now that the theorem has been proved for M — 1. Therefore for Em = [— N, N], 

(8.2) E w sup \E 1 \' 1 / 2 \E 2 \~ 1 / 2 \%{E 1 ,-- ,Em-i,[-N,N])\<CN £ -^' 2 

{Ei,-,E m _i} 

where C depends on e, M, 7. 

Lemma 8.3. For any rj E (0, 1) and for any u, 

(8-3) ||Tj M >|U eafc <CiV-^ 2 + [|^ 

where sup| S denotes the supremum over all M -tuples of sets Ej satisfying 

\Ei \ -\E 2 \> N 2 ~ r i. 

Proof. Denote by 1 the constant function l(x) = 1 for all x £ [— N, N). Define the nonran- 
dom averaging forms 

M 

A(h, • • • , /m) = N- 1 II fj(Lj( x > »))■ 

x,y j=l 

As for T, write A(Ei,--- ,Em) when each fj is the characteristic function of a set Ej. 
Then 

\A(E 1 , ■ ■ ■ ,E M )\ < CIE^E^ 2 = H/1II2II/2II2 II H/fcH«" 

k>2 

but the trivial bound 
(8.4) 

A(E lr - ,E M ) <A(E 1 ,E 2 ,1,--- ,1) < CN-^E^-lEzl = C(|Si| 1/2 |^ 2 | 1/2 /^)-|^i| 1/2 |^2| 1/2 
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expresses a significant improvement unless \E±\ ■ j-E^I ^ N. 

A(fi, " " " j /m) never decreases if all functions are replaced by their absolute values; nor 
does it decrease if some fj increases, provided that all fi are nonnegative. The same holds 
for (%-A)(h,--- J M ) = (Npy^^s^x^Uj fj(Lj(x,y)). Therefore if \\fj\U < 1 
for all j (£ {1, 2}, then 

\TMu--- Jm)\ < Wi,- ,fM)\ + \(%-A)(h,--- ,f M )\ 

< A(\M • • • , + (% - A)(\h\, |/ 2 |, 1, • • • , 1) 

<Z4(|/i|,|/ 2 |,l,-- - ,l)+7i(|/iU/ 2 |,l,-" ,!)■ 

Write 1 to denote the characteristic function of [— N, N], as well as this set itself. Let 
p < (1 — 7)/2. Then by induction, 

E w sup |£ir 1/2 |£2r 1/2 |7^(£i,-" ,£m-i,1)| <CN~p. 

{E m :m<M-l} 

Therefore by replacing Em by its complement [—AN, AN]\Em if necessary, we may assume 
without loss of generality that 

M M-l 

(8-5) En^^^I/)) >£[] 

a;,y j=l j=l 

Applying this argument to the indices m = M — 1,M — 2,... in sequence, we reduce to 
the case where the set 

(8.6) £ = {(x, y) : Lj(x, y) € Ej for all j G [1, M}} C E 1 x E 2 
satisfies 

(8.7) • \E 2 \ < 2 M \£\. 

□ 

For any set £ consider the random variable 

(8.8) X £ (u) = Np r UJ (x,y)= Y M^v)-p)- 

^ojXs{lj) = 0. The summands s UJ (x,y) — p are jointly independent, with values in [—1, 1]. 

has standard deviation a x p l ^ 2 \£\ 1 ^ 2 , with implicit constants depending on 70 but not 
on N. 

Chernoff's inequality [7] asserts that Pr {\X £ (oj)\ > Xa) < c e -cmin(A 2 ,Aa)_ get 

(8.9) A = Np • iV-^l 1 / 2 ^ 1 x N 1 -^ 1 ' 2 x N 1 -^/ 2 . 
Then 

min(A 2 ,Aa) = m.m{N 2 - 2p -^ ,N l ~ p -^\£\ l l 2 ) > cmin^ 2 " 2 ^, n 2 -p~^ v ' 2 ). 
Moreover 

(Np^Xa = N~ p \£\^ 2 < CmN^IE^ 2 ^^ 2 . 
Consider the exceptional event 

n* M (£) = {u en:\X £ (u)\> Xa}. 

By the definition of A, 

(8.10) \%(E U -- - ,E M )\ < N-^E^I 2 ^ 1 ' 2 for all a; £ 
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Choose v = 2p. Since p < (1 — 7)/2, we conclude that 
(8.11) Pr(n* M (£ )) < Ce" cArl+a 

for some (5 > 0. 
Define 

The total number of sets £, of all cardinalities, is at most 2 GMN , because £ is uniquely 
determined by E\ x • • • x Em ■ So 

Pr(U £ n* M (£)) < C2 CMN e- cNl+ \ 

and consequently Pr(fi^) < CN~ S for another 5 > 0. It follows from Lemma 18.11 and 
Holder's inequality that 



5.12) / ||Tj A/ )||op^<Ce 



-c 



N i+s 



for some C,c,5 £ M + . Since 77 = 2/3, (|8.12p . and (|8.10p . and Lemma [8.31 together give 
E tJ (||7^ A/ ' > 1 1 weak) < CN~ P . This completes the inductive step. □ 

Proof of Lemma \8.S\ For any linear operator Tf(x) = ^ K(x,y)f(y), 



v ' 

lp < trace ((T*T) X ) 

K(x 1 ,y 1 )K(x2,yi)K(x 2 ,y2)K(x 3 , y 2 ) ■ ■ ■ K(x K , y K )K(xuy K ). 



where the sum is taken over all 2K-tuples (x±, j/i, • • • , xk,Vk)- Apply this with -fC(x, y) = 
K u {x,y) = (Np)r u (x,y). Fix (xi, yi, x 2 , y 2 , ■ ■ ■ ,x K ,y K ). Define the multiplicity of (s,t) 
to be the number of factors K U! (xi, yj) in this product for which (xi, yj) = (s, t); here j = i, 
or j = i — 1, or j = K — 1 and i = 1. 

To (xi, yi, x 2 , 2/2 > ■ ■ ■ > x Kt Uk) is associated a nonincreasing partition of 2K, namely the 
ordered tuple of all nonzero multiplicities of elements (s,t) of Z 2 , written in nonincreasing 
order. We denote such a partition by (mi, • • • , mj), where $Z/=i mj = 2i^. 

The expectation of K UJ (xi, yi)K U! (x2, yi)K UJ (x2,y2) ■ ■ ■ Kw(xi,Uk) vanishes unless no (s, i) € 
[1, ■•■ ,N] 2 has multiplicity equal to one. Therefore only partitions with all mj > 2 con- 
tribute to the expectation. The number J of summands rrij is then < K. 

Lemma 8.4. The number of points (x\, yi, x 2 , y 2 , • • ■ ,xk,Vk) £ [LiV] 2 ^ which give rise 
to any particular partition (mi, • • • is < CkN j+1 . 

This will be proved below. 

The number of possible partitions is a function of 2i^. 

E w (#u,(xi, yi)^(x 2 ,yi)if a; (x 2 , y 2 ) • • • K u (x u y K fj < C K JJp = C K p J . 

3 

The product C K p J ■ N J+1 is < C K N(Np) K since J < K. Summing these upper bounds 
for expected values over all (x±, yi, x 2 , y 2 , ■ ■ ■ , xk, Vk) associated to a given partition, then 
summing over all partititions, yields 

(8.13) E w trace ((T*T) K ) < C K N(Np) K , 

whence 7L = 7L (2) satisfies EjTJop < C K {Np)- l N l l K (Np) l l 2 = C K N 1 ' K {Np)- 1 ' 2 . 
Since K may be taken to be arbitrarily large, this establishes Lemma 18.21 □ 
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Proof of Lemma \8.4\ Write xk+i = x\ to facilitate the discussion. If J = 1 then (x\,y{) = 
(xi,yi) for all 2 < i < if , and there are iV 2 = A^" ?+1 possible values of (xi,yi). If 
J > 1, set zi = (xi,yi), z 2 = (x 2 ,yi), z 3 = (x 2 ,y 2 ), z A = {x 3 ,y 2 ), . . . z 2K ~i = (xk,Vk), 
z 2 k = (xi,yx)- To the partition (mi,m 2 , ■ ■ ■ ,mj) of 2K we associate all possible equiv- 
alence relations on {zk : 1 < k < 2K} such that there are J equivalence classes, with 
Tni,m 2 , ■ ■ ■ ,mj elements. Such an equivalence relation is said to be feasible if there exist 
values of the Zk 6 [l,iV] such that Z[ = if and only if Z[,Zk belong to the same equiva- 
lence class. The number of equivalence relations is a function of K alone, so it suffices to 
bound the number of points (x\, y\, x 2} y 2} ■ ■ ■ ,XK,yx) £ [l,iV] 2i ^ which give rise to one 
equivalence relation. 

Consider any feasible equivalence relation associated to the partition (mi,-- - ,mj). 
Choose some equivalence class with mi elements Zk- Choose two coordinates, Xi and yi or 
Xi + i and yi, which determine all Zk in this class. These coordinates are said to be free, 
while any x\ or yi which is one of the two coordinates of some Zk in this class, is said to be 
bound. Thus the first equivalence class accounts for exactly two free coordinates. 

There must exist either Zk = (xi,yi) in this class such that (xi+i,yi) does not belong to 
this class, or Zk = (xi+i,yi) such that (xi + \,yi) does not belong to this class; otherwise the 
class would include every Zk, which is impossible since J > 1. In the first case, (xi + i,yi) 
belongs to a second equivalence class. The coordinates of any other Zk in this second class 
are determined by Xi + i,yi. yi is a coordinate of some element of the first class. Xi + \ 
cannot be a coordinate of some element of the first class, since (xi + i,yi) would belong 
to that class. Designate xi + \ to be a free coordinate, all coordinates of all other z\ in 
the second class are determined by xi + \ and yi, hence by Xi + \ together with the two free 
coordinates associated to the first class. Thus three free coordinates (together with the 
equivalence relation itself) are required to determine all coordinates of all points in the 
union of the first two classes. Repeating this reasoning, we obtain if J > 2 a third class and 
one additional free coordinate, and so on. Proceeding through all J classes, a total of J + 1 
free coordinates are obtained. Each of these coordinates can take on N values, so in all 
there are N J+1 possible points associated to an individual equivalence relation associated 
to a partition with J elements. □ 
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